Speaker Recognition with Spectral Dimension Features of Human Voices for Personal Authentication

نویسنده

  • Wen-Shiung Chen
چکیده

Biometric recognition is more and more important due to security applications all over the world. Mobile phone becomes popular in recent years. Therefore, voice recognition on mobile devices for recognizing a speaker’s identity plays a potential role. This paper presents a speaker recognition method which combines a non-linear feature, named spectral dimension (SD), with Mel Frequency Cepstral Coefficients (MFCC). In order to improve the performance of the proposed scheme, the Mel-scale method is adopted for allocating sub-bands and the pattern matching is trained by Gaussian mixture model. Some problems related to spectral dimension are discussed and the comparison with other simple spectral features is made. We observe that our proposed methods can improve the performance in different components. For instance, speaker verification combining MFCC with our proposed SD features gives a good performance of EER=2.31% by 32_Multi-GMM. The relative improvement of about 22% may be achieved, which is better than the method that is based only on MFCC with EER=2.96%. Index Terms – Biometric Recognition, Personal Authentication, Speaker Identification, Speaker Verification, Fractal Dimension, Spectral Dimension.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic speaker recognition as a measurement of voice imitation and conversion

Voices can be deliberately disguised by means of human imitation or voice conversion. The question arises to what extent they can be modified by using either method. In the current paper, a set of speaker identification experiments are conducted; first, analysing some prosodic features extracted from voices of professional impersonators attempting to mimic a target voice and, second, using both...

متن کامل

Robust Speaker Verification over Narrowband and Wideband Communication Channels

Modern speaker recognition applications involve the authentication of users by their voices. A wide range of systems requires reliable personal recognition techniques to either determine or confirm the identity of a person requesting some type of their services. The main purpose of these techniques is to ensure that provided services are accessed only by a legitimate user and no one else. Voice...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

Classification of emotional speech using spectral pattern features

Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...

متن کامل

Robust speaker identification against computer aided voice impersonation

Speaker Identification (SID) systems offer good performance in the case of noise free speech and most of the on-going research aims at improving their reliability in noisy environments. In ideal operating conditions very low identification error rates can be achieved. The low error rates suggest that SID systems can be used in real-life applications as an extra layer of security along with exis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015